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( 1 ) GENERAL IKFORMATION: 

( I i i ) NUMBER OF SEQUENCES: 12 



SEQUENCE LISTING 



( 2 ) INFORMATION FOR SEQ ID NO: 1: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 275 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE cDNA 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Aspergillus gigantcus 

( i x ) FEATURE: 

( A ) NAME/KEY: 5'UTR 
( B ) LOCATION: 1..45 

( i x ) FEATURE: 

( A ) NAME/KEY: CDS 

( B ) LOCATION: 46..22S 

( C ) D3ENTIFICATION METHOD: experimental 

( D ) OTHER INFORMATION: /codon_start- 46 

/ function* "antifungal agent" 

/ product* "antifungal peptide)" 

/ evidence- EXPERIMENTAL 

/ notes "antifungal agent, especially on 

Rhizoctonia sol an j, various Aspergillus, Fusaria 

and Trichophyton species" 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TTGCCACCCC CGTTGAAGCC GATTCTCTCA CCGCTGGTGG TCTGG ATG CAA GAG 

Met Gin G I u 
1 

ATG AGA GCG CGG GTT TTG GCC A C A TAC A A T GGC AAA TGC TAC A A G A A G 
Met Arg Ala Arg Val Leu Ala Th i Ty i Asn Gly Lys Cys Tyr Lys Lys 



1 0 



1 5 



GAT A A T ATC TGC A A G TAC A A G GCA CAG AGC GGC A A G ACT GCC ATT TGC 
Asp Asn lie Cys Lys Tyr Lys Ala Gin Ser Gly Lys Thr Ala Me Cys 



2 0 



2 5 



3 0 



3 5 



A A G TGC TAT GTC AAA A A G TGC CCC CGC GAC GGC GCG AAA TGC GAG TTT 
Lys Cys Tyr Val Lys Lys Cys Pro Arg Asp Gly Ala 



4 0 



4 5 



Lys Cys Glu Pbc 
5 0 



GAC AGC TAC A A G GGG A A G TGC TAC TGC TAGACGGTGA GCGAAGGGAC 
Asp Ser Tyr Lys Gly Lys Cys Tyr Cys 



5 4 



1 0 2 



1 5 0 



1 9 8 



2 4 5 



5 5 



6 0 



GAAGTAGGCT GGGGGTTATT TTACTCTGCT 



2 7 5 



( 2 ) INFORMATION FOR SEQ tD NO: 2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 60 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 2; 

Met Gin Glu Met Arg Ala Arg Val Leu Ala Thr Tyr Asn Gly Lys Cys 

1 5 10 15 



Tyr Lys Lys Asp Asn lie Cys Lys Tyr Lys Ala Gin Ser Gly Lys Thr 

2 0 2 5 3 0 
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Ala Mc Cys Lys Cys Tyi Val Ly j Lyi Cys Pro Arg Asp Gly Ala Lys 
3 5 4 0 4 5 

Cys Glu Phe Atp Ser Tyr Ly t Gly Ly* Cys Tyr Cys 
5 0 5 5 6 0 

( 2 ) INFORMATION FOR SEQ ID NO: 3: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 51 ammo acid* 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( v ) FRAGMENT TYPE: C-terrainal 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Aspergillus giganteus 

( i x ) FEATURE: 

( A ) NAME/KEY: Protein 
( B ) LOCATION: 1.31 

( D ) OTHER INFORMATION: /note- "active protein fragment of 
AFP" 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ala Th i Tyr A s n Gly Lys Cys Tyr Lys Lys Asp Asn lie Cys Lys Tyr 
1 5 1 0 15 

Lys Ala Gin Ser Gly Lys Thr Ala lie Cys Lys Cys Tyr Val Lys Lys 

2 0 2 5 3 0 

Cys Pro Arg Asp Gly Ala Lys Cys Glu Phe Asp Ser Tyr Lys Gly Lys 
3 5 4 0 4 5 

Cys Tyr Cys 
5 0 

( 2 ) INFORMATION FOR SEQ ID NO: 4: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 1032 base pairs 
( B ) TYPE: nucleic ucid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: cDNA 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Hordeum vulgare 
( B ) STRAIN: Lev. Piggy 

( v i i ) IMMEDIATE SOURCE: 

( A ) LIBRARY: cDNA gene bank m lambda-gt-U -phages 

( i x ) FEATURE: 

( A ) NAME/KEY: 5'UTR 
( B ) LOCATION: 1..42 

( i x ) FEATURE: 

( A ) NAME/KEY: CDS 

( B ) LOCATION: 43.-885 

( D ) OTHER INFORMATION: /codon_start- 43 

/ function- "antifungal activity" 

/ product" "protein synthesis inhibitor (PSI)" 

/ note- "antifungal activity, especially od spores 

of Tricho derma rccsii and Fusarnim sporctricboides 

and on Rbizoctonia solani." 

( i x ) FEATURE: 

( A ) NAME/KEY: 3'UTR 

( B ) LOCATION: 886-1032 

( D ) OTHER INFORMATION: /partial 

/ note- "46 nucleotides at the 3'-cnd not shown." 

( i x ) FEATURE: 
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( A ) NAME/KEY: polyA_5(gnai 
( B ) LOCATION: 930-935 

( D ) OTHER INFORMATION: /note- "potential polyadenylation 
signal" 

( i x ) FEATURE: 

( A ) NAME/KEY: polyA_signal 
( B ) LOCATION: 963-976 

( D ) OTHER INFORMATION: /note* "potential polyadcnylation 
signal" 

( i x ) FEATURE: 

( A ) NAME/KEY: polyA_*ignal 
( B ) LOCATION: 1 002-1011 

( D ) OTHER INFORMATION: /note- "potential polyadenylaUon 
signal" 

( i x ) FEATURE: 

( A ) NAME/KEY: maL_peptide 
( B ) LOCATION: 46-886 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



CTTAATAGCA CATCTTGTCC GTCTTAGCTT TGCATTACAT CC ATG GCG G C A AAG 

Mel Ata Ala Lys 



5 4 



ATG GCG AAG AAC GTG G A C AAG CCC CTC TTC ACC GCG ACG TTC AAC GTC 
Met Ala Lys Asn Val Asp Lys Pro Leu Pbc Thr Ala Thr Pbe Asn Val 



1 0 



1 5 



2 0 



1 0 2 



CAG GCC AGC TCC GCC GAC TAC GCC ACC TTC ATC GCC GGC ATC CGC AAC 
Gin Ala Set Scr Ala Asp Tyi Ala Thr Pbc lie Ala Gly lie Aig Asn 



2 5 



3 0 



3 5 



1 5 0 



AAG CTC CGC AAC CCG GCG CAC TTC TCC CAC AAC CGC CCC GTG CTG CCG 
Lys Leu Arg Asn Pro Ala His Pbe Ser His Asn Arg Pro Val Leu Pro 

4 0 4 5 '50 



1 9 8 



CCG 
P r o 



GTC 

V D 1 



GAG 
G I u 
5 5 



CCC AAC GTC CCG CCG AGC AGG TGG TTC CAC GTC GTG CTC 
Pro Asn Vol Pro Pro Ser Arg Trp Phe His Val Val Leu 



6 0 



6 5 



2 4 6 



AAG 
Lys 



GCC 
A I a 

7 0 



TCG CCG ACC AGC GCC GGG CTC ACG CTG GCC ATT CGG GCG GAC 
Ser Pro Tbr Scr Ala Gly Leu Tb i Leu Ala lie Arg Ala Asp 



7 5 



8 0 



2 9 4 



AAC 
Asn 
8 5 

CTC 
Leu 



ACC 
Th r 



CTC 
Leu 



ACC 
T b r 



ATC 
I I e 



ACC 
T h i 



TAC 
Ty r 



GGC 
G 1 v 



AAG 

Lys 
1 5 0 



TAC 
Ty i 



CCG 
Pro 



CGC 
Arg 



CGG 
A r g 
1 3 5 

GCC 
A 1 a 



CTG 
L e u 



GGC 
G I y 



GAC 
Asp 
1 2 0 

CAG 
G 1 n 



GAC 
Asp 



GAG GGC TTC AAG 

Glu Gly Phe Lys 
9 0 

CTC ATC CCC GGC 

Leu lie Pro Gly 
1 0 5 

CTC CTC GGC GAC 

Leu Leu Gly Asp 



AGC AGC 
Ser Scr 



CAG 
G 1 o 



AAG 
Lys 



ICTC CCgI GAC 
Leu P r o 1 Asp 

l 4 0 

Leu Ala 
CCG TCC GGC 
P r o S c r Gly 
15 5 



R>GC7 ACC 
le I T.I T h r 

GCC "i i o 
Ala 

ACC GAC 

Tbr Asp 
1 2 5 

GCG GTG 

Ala Val 



CCG AAG 
Pro Lys 



GAC GGC ACC TGG TGG GAG 

Asp Gly Tbr Trp Trp Glu 

9 5 10 0 

TAC GTC GGG TTC GGC GGC 

Tyr Val Gly Phe Gly Gly 

1 1 5 

AAG CTG ACC AAC GTC GCT 

Lys Leu Thr Asn Val Ala 

1 3 0 

ACC GCC CTC CAC GGG CGC 

Tbr Ala Leu His Gly Arg 
1 4 5 

CAG CAG CAG GCG AGG GAG 

Glu Gin Gin Ala Arg Glu 
1 6 0 



3 4 2 



3 9 0 



4 3 8 



4 8 6 



5 3 4 



GCG GTG ACG ACG CTG CTC CTC ATG GTG AAC GAG GCC ACG CGG TTC CAG 
Ala Val Thr Thr Leu Leu Leu Met Val Asn Glu Ala Tbr Arg Phe Gin 



1 6 5 



1 7 0 



1 7 5 



1 8 0 



5 8 2 



ACG GTG TCT GGG TTC GTG GCC GGG TTC CTG CAC CCC AAG GCG GTG GAG 
Thr Val Ser Gly Pbe Val Ala Gly Leu Leu His Pro Lys Ala Val Glu 



1 8 5 



1 9 0 



1 9 5 



6 3 0 



AAC AAG AGC GGG AAG ATC GGC A A T GAG ATG AAG GCC CAG GTG AAC GGG 
Lys Lys Ser Gly Lys lie Gly Asn Glu Met Lys Ala Gin Val Asn Gly 



2 0 0 



2 0 5 



2 1 0 



6 7 8 
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TGC C A G GAC CTC TCC GCG GCG CTG CTG A A G ACG GAC CTG A A G CCT CCG 726 
Trp Gin Asp Leo Set Ala Ala Leu Leu Lys Thr A 6 p V a I Lys Pro Pro 
2 1 5 2 2 0 2 2 5 

CCG GGA A A G TCG CCA GCG A A G TTC GCG CCG ATC GAG A A G A T G GGC GTG 774 
Pro Gly Lys Ser Pro Ala Lys Phe Ala Pro Me Glu Lys Mel Gly Val 
230 235 240 

AGG ACG GCT GTA CAG GCC GCC AAC ACG CTG GGG ATC CTG CTG TTC GTG 822 
A r g Thr Ala Val Gin Ala Ala Asd Thi Leu Gly lie Leu Leu Pbe Val 
245 250 2 5 5 260 

GAG GTG CCG GGT GGG TTG ACG GTG GCC AAC GCG CTG GAG CTG TTC CAT 870 
Glu Val Pro Gly Gly Leu Tbr Val Ala Lys Ala Leu Glu Leu Pbe His 

2 6 5 2 7 0 2 7 5 

GCG AGT GGT GGG AAA TAGGT AGTTT TCCAGGTATA CCTGCATGGG TAGTGTAAAA 925 
Ala Ser Gly Gly Lys 

2 8 0 

GTCGAATAAA CATGTCA CAG AGTGACGGAC TGATATAAAT AAATAAATAA ACGTGTCACA 985 
GAGTTACATA TAAACAAATA AATAAATAAT TAAAAATGTC CAGTTTA 1032 

( 2 ) INFORMATION FOR SEQ ID NO: 5: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 281 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Ala Lys Met Ala Lys Asn Val Asp Lys Pro Leu Pbe Tbr Ala 
1 5 10 15 

Thr Phe Asn Val Gin Ala Ser Ser Ala Asp Tyr Ala Thr Phe lie Ala 

2 0 2 5 3 0 

Gly lie A r g Asn Lys Lcn Arg Asn Pro Ala His Phe Ser His Asn A r g 
3 5 4 0 4 5 

Pro Vi I Leu Pro Pro Val Glu Pro Asn Val Pro Pro Ser Arg Trp Pbe 
5 0 5 5 6 0 

His Val Val Leu Lys Ala Ser Pro Tbr Ser Ala Gly Leo Tbr Leu Ala 
65 70 75 80 

lie Arg Ala Asp Asn lie Tyr Leu Glu Gly Pbe Lys Ser Ser Asp Gly 

8 5 9 0 9 5 

Thr Trp Trp Gin Leu Thr Pro Gly Leu Me Pro G I y 1 y] T h r Tyr Val 

1 0 0 1 0 5 Aj| 110 

Gly Phe Gly. Gly Tbr Tyr Arg Asp Leu Leu Gly Asp Thr Asp Lys Leu 
115 12 0 , 12 5 

Tbr Asn Val Ala Leu Gly Arg Gin Gin Lcu^rojAsp Ala Val Tbr Ala 

1 3 0 1 3 5 Ala 1 4 0 

Leu His Gly Arg Tbr Lys Ala Asp Lys Pro Ser Gly Pro Lys Gin Gin 
145 150 155 160 

Gin Ala Arg Glu Ala Val Thr Tbr Leu Leu Leu Met Val Asn Glu Ala 

1 6 5 1 7 0 1 7 5 

Tbr Arg Phe Gin Tbr Val Ser Gly Pbe Val Ala Gly Leu Leu His Pro 

18 0 18 5 19 0 

Lys Ala Val Glu Lys Lys Ser Gly Lys Me Gly Asn Glu Mel Lys Ala 
195 200 205 

Gin Val Asn Gly Trp Gin Asp Leu Ser Ala Ala Leu Lcn Lys Thr Asp 

2 1 0 2 1 5 2 2 0 

Val Lys Pro Pro Pro Gly Lys Ser Pro Ala Lys Phe Ala Pro lie Gin 
225 230 235 240 
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Lys Met G I y Val Arg Thi Ala Val Gin Ala Ala Asn Thi Leu Gly lie 

245 250 255 

Leu Leu Phe Val Gin Val Pro Gly Gly Leo Thr Val Ala Lys Ala Leu 

260 265 270 



Glu Leu Phe His A 1 o Ser Gly Gly Lys 



( 2 ) INFORMATION FOR SEQ ID NO: 6: 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 480 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNE5S: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE cDNA 



( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Hordeum vulgare 
( B ) STRAIN: Lev. Piggy 



( v i i ) IMMEDIATE SOURCE: 

( A ) LIBRARY: cDNA gene bank in lambda-gt-U -phages 
( B ) CLONE: incomplete psi cDNA clone 

( i x ) FEATURE; 

( A ) NAME/KEY: CDS 
( B ) LOCATION: 1.J51 
( D ) OTHER INFORMATION: /partial 
/ co don start- 1 

/ function - "protein synthesis inhibitor" 
/ product" "protein synthesis inhibitor" 
/ standarcLjianje= "PSI" 

/ note* "aminoterminally incomplete protein horn 
an incomplete PSI cDNA clone" 



( i x ) FEATURE: 

( A ) NAME/KEY: 3'UTR 
( B ) LOCATION: 3S2..487 



( i x ) FEATURE: 

( A ) NAME/KEY: polyA_signal 
( B ) LOCATION: 404..409 

( D ) OTHER INFORMATION: Mote- "potential polyadenylation 
signal" 

( i x ) FEATURE: 

( A ) NAME/KEY: polyA_signal 
( B ) LOCATION: 437..442 

( D ) OTHER INFORMATION: /note- "potential polyadenylation 
signal" 

( i x ) FEATURE: 

( A ) NAME/KEY: polyA^signal 
( B ) LOCATION: 445..450 

( D ) OTHER INFORMATION: /note= "potential polyadenylation 
signal" 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



GCG GTG ACG 

Ala Val Thr 
1 

ACG GTG TCG 

Thr Val Ser 



A A G A A G AGC 
Lys Lys Sei 
3 5 

TGG CAG GAC 
Trp Gin Asp 

5 0 



ACG CT G CT C 

Thr Leu Leu 

5 

GGG TTC GTG 

Gly Phe Val 
2 0 

GGG A A G ATC 

Gly Lys Me 

CTG TCC GCG 

Leo Ser Ala 



CTC ATG GTG 

Leu Met Val 

GCC GGG CTG 

Ala Gly Leu 
2 5 

GGC A A T GAG 

Gly Asn Glu 
4 0 

GCG CTG CTG 

Ala Leu Leu 
5 5 



AAC GAG GCC 

Asn Glu Ala 
1 0 

CTG CAC CCC 

Leu His Pro 



ATG A A G GCC 
Met Lys Ala 

A A G ACG GAC 
Lys Thr Asp 
6 0 



ACG CGG TTC 

Thr Arg Phe 
1 5 

A A G GCG GTG 

Lys Ala Val 
3 0 

CAG GTG AAC 

Gin Val Asn 
4 5 

GTG A A G CCC 

Val Lys Pro 



CAG 4 8 

G 1 ij 



GAG 9 6 

G 1 u 



GGG 14 4 

G I y 



CCG 1 9 2 

P r o 
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CCC GGA A A G TCG CCA GCG A A G TTC ACG CCG ATC GAG A A G ATG GGC GTG 240 
Pro Gly Lys Ser Pro Ala Lys Pbe Tbr Pro lie Glu Lys Met Gly Vat 
65 70 75 80 

AGG ACT GCT GAG CAG GCT GCG GCT ACT TTG GGG ATC CTG CTG TTC GTT 288 
Arg Thr Ala Glu Gin Ala Ala Ala Tbr Leu Gly lie Leo Leu Pbe V a I 

8 5 9 0 9 5 

GAG GTG CCG GGT GGG TTG ACG GTG CCC AAG GCG CTG GAG CTG TTT CAT 336 
Glu Val Pro Gly Gly Leu Thr Val Ala Lys Ala Leu Glu Leu Pbe His 

10 0 10 5. 110 

GCG AGT GGT GGG AAA TAGGTAGTTT TGCAGGTATA CCTGCATGGG TAAATGTAAA 391 
Ala Ser Gly Gly Lys 
1 1 5 

AGTCGAATAA AAATGTCACA GAGTGACGGA CTGATATAAA TAAATTAATA AACATGTCAT 45 1 

CATGAGTGAC AGACTGATAT AAATAAATA 480 

( 2 ) INFORMATION FOR SEQ ID NO: 7: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 117 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ala Val Thr Thr Leu Leu Leu Met Val A a n Glu Ala Thr Arg Pbe Gin 
1 5 10 15 

Tbr Val Ser Gly Pbe Val Ala Gly Leu Leo His Pro Lys Ala Val Glu 

2 0 2 5 3 0 

Lys Lys Ser Gly Lys lie Gly Asu Glu Met Lys Ala Gin Val Asu Gly 
3 5 4 0 4 5 

Trp Gin Asp Leu Ser Ala Ala Leu Leu Lys Thr Asp Val Lys Pro Pro 
5 0 5 5 6 0 

Pro Gly Lys Ser Pro Ala Lys Pbe Tbr Pro lie Glu Lys Met Gly Val 
6 5 7 0 7 5 8 0 

Arg Thr Ala Glu Gin Ala Ala Ala Thr Leu Gly lie Leu Leu Pbe Val 

8 5 9 0 9 5 

Glu Val Pro Gly Gly Leu Thr Val Ala Lys Ala Leu Glu Leu Phe His 

10 0 10 5 110* 

Ala Ser Gly Gly Lys 
1 1 5 

( 2 ) INFORMATION FOR SEQ ID NO: 8: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 2329 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: cDNA 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Serratia marccsccns 

( v i i ) IMMEDIATE SOURCE: 

( A ) LIBRARY: Cosmid bank bora Serratia marcescens 

( i x ) FEATURE: 

( A ) NAME/KEY: misc_feature 

( B ) LOCATION: 1..2329 

( C ) IDENTIFICATION METHOD: experimental 

( D ) OTHER INFORMATION: /function-" exo-cbitinase" 

/ product o "ChiS protein" 

/ evidence- EXPERIMENTAL 



1 



1 
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/ note- "sequence listing of the ChiS gene from a 
plasmid pLChiS from Exoli A 5187" 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CAGGGCCTTG TCAATAATGA CAACACCCTG GCTGAAGAGT GTGGTGCAAT ACTGATAAAT 60 

ATTTATCTTT CCTTAATAGA AAATTCACTA TCCTTATTTG TCATGTTTTC TTTTATTTAT 120 

ATGAAAATAA ATTCACGCTT GCTGAATAAA ACCCAGTTGA TAGCGCTCTT GTTTTTGCGC 180 

CTTTTTTATT TATAGTACTG AATGTACGCG GTGGGAATGA TTATTTCGCC ACGTGGAAAG 240 

ACGCTGTTGT TATTTATTGA TTTTAACCTT CGCGGATTAT TGCGGAATTT TTTCGCTTCG 300 

GCAATGCATC GCCACGATTA ACTCTTTTAT GTTTATCCTC TCGGAATAAA GGAAT CAGTT 360 

ATGCGCAAAT TTAATAAACC GCTGTTGGCG CTGTTGATCG GCAGCACGCT GTGTTCCGCG 420 

GCGCAGGCCG CCGCGCCGGG CAAGCCGACC ATCGCCTGGG GCAACACCAA GTT CGCCATC 480 

GTTGAAGTTG ACCAGGCGGC TACCGCTTAT AATAATTTGG TGAAGGTAAA AAATGCCGCC 540 

GATGTTTCCG TCTCCTGGAA TTTATGGAAT GGCGACACCG GCACGACGGC AAAAGTTTTA 600 

TTAAATGGCA AAGAGGCGTG GAGTGGTCCT TCAACCGGAT CTTCCGGTAC GGCGAATTTT 660 

AAAGTGAATA AAGGCGGCCG TTATCAAATG CAGGTGGCAC TGTGCAATGC CGACGGCTGC 720 

ACCGCCAGTG ACGCCACCGA AATTGTGGTA GCCGACACCG ACGGCAGCCA TTTGGCGCCG 780 

TTGAAAGAGC CGCTGCTGGA AAAGAATAAA CCGTATAAAC AGAACTCCGG CAAAGTGGTC 840 

GGTTCTTATT TCGTCGAGTG GGGCGTTTAC GGGCGCAATT TCACCGTCGA CAAGATCCCG 90 0 

GCGCAAAACC TGACCCACCT GCTGTACGGC TTTATCCCGA TCTGCGGCGG CAATGGCATC 960 

AACGACAGCC TGAAAGAGAT TGAAGGCAGC TTCCAGGCGT TGCAGCGCTC CTGCCAGGGC 1020 

CGCGAGGACT TCAAAGTCTC GATCCACGAT CCGTTCGCCC CGCTGCAAAA AGCGCAGAAG 1080 

GGCGTGACCG CCTGGGATGA CCCCTACAAG GGCAACTTCG GCCAGCTGAT GGCGCTGAAG 1140 

CAGGCGCATC ctgacctgaa AATCCTGCCG tcgatcggcg GCTGGACGCT GTCCGACCCG 1200 

TTCTTCTTCA TGGGCGACAA GGTGAAGCGC GATCGCTTCG TCGGTTCGGT, GAAAGAGTTC 1260 

CTGCAGACCT GGAAGTTCTT CGACGGCGTG GATATCGACT GGGAGTTCCC GGGCGGCAAA 1320 

GGCGCCAACC CTAACCTGGG CAGCCCGCAA GACGGGGAAA CCTA TGTGCT GCTGATGAAG 1380 

GAGCTGCGGG CGATGCTGGA TCAGCTGTCG GTGGAAACCG GCCGCAAGTA TGAGCTGACC 1440 

TCCGCCATCA GCGCCGGTAA GGACAAGATC GACAAGGTGG CTTACAACGT TGCGCAGAAC 1500 

TCGATGGATC ACATCTTCCT GATGAGCTAC GACTTCTATG GCGCCTTCGA TCTGAAGAAC 1560 

CTGGGGCATC AGACCGCGCT GAATGCGCCG GCCTGGAAAC CGGACACCGC CTACACCACG 1620 

GTGAACGGCG TCAATGCGCT GCTGGCGCAG GGCGTCAAGC CGGGCAAAAT CGTCGTCGGC 1680 

ACCGCCATGT ATGGCCGCGG CTGGACCGGG GTGAACGGCT ACCAGAACAA TATTCCGTTC 1740 

ACCGGCACCG CCACCGGGCC GGTTAAAGGC ACCTGGGAGA ACGGTATCGT GGACTACCGC 1800 

C AAATCGCCG GCCAGTTCAT GAGCGGCGAG TGGCAGTATA CCTACGACGC CACGGCGGAA 1860 

GCGCCTTACG TGTTCAAACC TTCCACCGGC GAT CTGATCA CCTTCGACGA TGCCCGCTCG 1920 

GTGCAGGCTA AAGGCAAGTA CGTGTTGGAT AAGCAGCTGG GCGGCCTGTT CTCCTGGGAG 1980 

ATCGACGCGG ATAACGGCGA TATTCTCAAC AGCATGAACG CCAGCCTGGG CAACAGCGCC 2040 

GGCGTTCAAT AATCGGTTGC AGTGGTTGCC GGGGGATATC CTTTCGCCCC CGGCTTTTTC 2 100 

GCCGACGAAA GTTTTTTTAC GCCGCACAGA TTGTGGCTCT GCCCCGAGCA AAACGCGCTC 2 160 

ATCGGACTCA CCCTTTTGGG TAATCCTTCA GCATTTCCTC CTGT CTTTAA CGGCGATCAC 2 2 2 0 

AAAAATAACC GTTCAGATAT TCATCATTCA GCAACAAAGT TTTGGCG TTT TTTAACGGAG 2 2 8 0 



i 
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TTAAAAACCA GTAAOTTTGT GAGCGTCACA CCAATGCGCT AAAAATCCG 2329 

( 2 ) INFORMATION FOR SEQ ID NO: 9: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 1002 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: cDNA 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Hordcum vulgare 
( B ) STRAIN: L 

( i x ) FEATURE: 

( A ) NAME/KEY: 5'UTR 
( B ) LOCATION: 1..63 

( 1 x ) FEATURE: 

( A ) NAME/KEY: CDS 
( B ) LOCATION: 64..861 
( D ) OTHER INFORMATION: /codon_start« 64 
/ function- "chitinase" 

/ prodnct= "26 kD preprotein of chitinase G (CWG)" 
/ note- "antifungal activity, especially on 
Trichoderroa recsii and Fnsariuin sporotrichoides B5 
well as Rhizoctonia solani and Botrytis cinereo." 

( i x ) FEATURE 

( A ) NAME/KEY: 3'UTR 

( B ) LOCATION: 862..1002 

( D ) OTHER INFORMATION: /partial 

/ note* "11 nucleotides at 3 'end not shown" 

( i x ) FEATURE 

( A ) NAME/KEY: polyA^signal 
( B ) LOCATION: 905..910 

( D ) OTHER INFORMATION: Mote- "potential polyadenylation 
signal" 

( i x ) FEATURE: 

( A ) NAME/KEY: sig_pcptide 
( B ) LOCATION: 64..294 

( D ) OTHER INFORMATION: Mole- "probable signal peptide 
sequence" 

( i x ) FEATURE 

( A ) NAME/KEY: sig_peplidc 
( B ) LOCATION: 298-312 

( D ) OTHER INFORMATION: /note- "probable signal peptide 
sequence" 

( i x ) FEATURE: 

( A ) NAME/KEY: sfe-^eptide 
( B ) LOCATION: 349-378 

( D ) OTHER INFORMATION: /note- "probable signal peptide 
sequence" 

( i x ) FEATURE 

( A ) NAME/KEY: sig_peptidc 
( B ) LOCATION: 466.-588 

( D ) OTHER INFORMATION: /note- "probable signal peptide 
sequence" 

( i x ) FEATURE 

( A ) NAME/KEY: sfe_peptide 
( B ) LOCATION: 607 ..861 

( D ) OTHER INFORMATION: /note- "probable signal peptide 
sequence" 

( 1 x ) FEATURE 

( A ) NAME/KEY: mat_peptide 
( B ) LOCATION: 133 JB61 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



1 



1 
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CCTACCACAG TAGCGTAACG GTAAACACCG AGT ACGGTAC TCTGTGCTTT GTTGGCTCGC 60 

ACA ATG A G A TCG CTC GCG GTG GTG GTG GCC GTG GTA GCC ACG GTG GCC 108 

Met Arg Ser Leo Ala Val V» I Vol Ala Vi I Vi I Ala Thr Vol Ala 

-23 -20 -15 -10 

ATG GCC A T C GGC ACG GCG CGC GGC AGC GTG TCC TCC ATC GTC TCG CGC 156 

Mel Ala lie Gly Thr Ala Arg Gly Ser Val Ser Ser lie Val Ser Arg 

- 5 1 5 

CCA CAG TTT GAC CGC ATG CTT CTC CAC CGC A A C GAC GGC GCC TGC CAG 204 

Ala Gin P h e Asp Arg Met Leu Leu His Arg Asn Asp Gly Ala Cys Gin 

10 15 2 0 

GCC A A G GGC TTC TAC ACC TAC GAC GCC TTC GTC GCC GCC G C A GCC GCC 252 

Ala Lys Gly Phe Tyr Thr Tyi Asp Ala Phe Val Ala Ala Ala Ala Ala 

2 5 3 0 3 5 4 0 

TTC CCG GGC TTC GGC ACC ACC GGC AGC GCC GAC GCC CAG A A G CGC GAG 300 

Phc Pro Gly Phc Gly Thr Thr Gly Scr Ala Asp Ala Gin Lys Arg Glu 

4 5 5 0 5 5 

GTG GCC GCC TTC CTA G C A CAG ACC TCC CAC GAG ACC ACC GGC GGG TGG 348 

Val Ala Ala Phe Leu Ala Gin Thr Ser His Glu Thr Thr Gly Gly Trp 

6 0 6 5 7 0 

GCG ACT G C A CCG GAC GGG GCC TTC GCC TGG GGC TAC TGC TTC A A G CAG 396 

Ala Thr Ala Pro Asp Gly Ala Phe Ala Trp Gly Tyr Cys Phe Lys Gin 

7 5 8 0 8 5 

G A A CGT GGC GCC TCC TCC GAC TAC TGC ACC CCG AGC GCA CAA TGG CCG 444 

Glu Arg Gly Ala Ser Ser Asp Tyr Cys Thr Pro Ser Ala Glu Trp Pro 

9 0 9 5 1 0 0 

TGC GCC CCC GGG A A G CGC TAC TAC GGC CGC GGG CCA ATC CAC CTC TCC 492 

Cys Ala Pro Gly Lys Arg Tyr Tyr Gly Arg Gly Pro Me Gin Leu Ser 

105 110 115 120 

CAC AAC TAC AAC TAT GGA CCT GCC GGC CGG GCC ATC GGG GTC GAT CTC 540 

His Asn Tyr A s n Tyr Gly Pro Ala Gly Arg Ala Me Gly Val Asp Leu 

12 5 13 0 13 5 

CTG GCC AAC CCG GAC CTG GTG GCC ACG GAC GCC ACT GTG GGC TTT A A G 588 

Leu Ala Asn Pro Asp Leu Val Ala Thr Asp Ala Thr Val Gly Phe Lys 

14 0 14 5 15 0 

ACG GCC ATC TGG TTC TGG ATG ACG GCG CAG CCG CCC A A G CCA TCG AGC 636 

Thr Ala lie Trp Phe Trp Met Thr Ala Gin Pro Pro Lys Pro Ser Ser 

155 160 165 

CAT GCT GTG ATC GCC CGC CAG TGG AGC CCG TCA GGG GCT GAC CGG GCC 684 

His Ala Val Me Ala Gly Gin Trp Ser Pro Ser Gly Ala Asp Arg Ala 

1 7 0 1 7 5 1 8 0 

GCA GGC CGG GTC CCC GGG TTT GGT GTG ATC ACC AAC ATC ATC AAC GGC 732 

Ala Gly Arg Val Pro Gly Phe Gly Val lie Thr Asn Me lie Asn Gly 

185 190 195 200 

GGG ATC GAG TGC GGT CAC GGG CAG GAC AGC CGC GTC GCC GAT CGA ATC 780 

Gly Me Glu Cys Gly His Gly Gin Asp Ser Arg Val Ala Asp Arg Me 

2 0 5 2 10 2 15 

GGG TTT TAC A A G CGC TAC TGT GAC ATC CTC GGC GTT GGC TAC GGC AAC 828 

Gly Phe Tyr Lys Arg Tyr Cys Asp Me Leu Gly Val Gly Tyr Gly Asn 

220 225 230 

AAC CTC GAT TGC TAC AGC CAG AGA CCC TTC GCC TAATTAATTA GTCATGTATT 881 

Asn Leu Asp Cys Tyi Ser Gin Arg Pro Phe Ala 

2 3 5 2 4 0 

AATCTTGGCC CTCCATAAAA TACAATAAGA GCATCGTCTC CTATCTACAT GCTGTAAGAT 941 

GTAACTATGG TAACCTTTT A TGGGGAACAT AACAAAGGCA TCTCGTATAG ATGCTTTGCT 1001 

A 10 0 2 

( 2 ) INFORMATION FOR SEQ ID NO:10: 

( i ) SEQUENCE CHARACTERISTICS: 





27 



5,804,184 



-continued 



28 



( A ) LENGTH: 266 amino acids 
( B ) TYPE: amino add 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met Arg Scr Leo Ala Val Val Vi I Ala V.I Va 



- 2 3 



2 0 



1 5 



Ala Thr Val Ala Met 
- 1 0 



Ala lie Gly Thr Ala Arg Gly Ser Val Ser Ser lie Val Ser Arg Ala 
- 5 1 5 

Gin Pbe Asp Arg Met Leo Leu His Arg Asn Asp Gly Ala Cys Gin Ala 

1 0 1 5 2 0 2 5 

Lys Gly Phe Tyr Thr Tyr Asp Ala Phe Val Alo Alo Ala Alo Ala Phe 

3 0 3 5 4 0 

Pro Gly Pbe Gly Tbr Tbr Gly Scr Ala Asp Alo Gin Lys Arg Gtu Vol 

4 5 5 0 5 5 

Ala Ala Pbe Leu Ala Gin Thr Scr His Gin Thr Tbr Gly Gly Tip Ala 

6 0 6 5 7 0 

Thr Ala Pro Asp Gly Ala Phe Ala Tr p Gly Tyr Cys Phe Lys Gin Glu 
7 5 8 0 8 5 

Arg Gly Ala Ser Ser Asp Tyr Cys Thr Pro Ser Ala Gin T t p Pro Cys 
90 95 1O0 105 

Ala Pro Gly Lys Arg Tyr Tyr Gly Arg Gly Pro lie Gin Leu Ser His 

110 115 12 0 

Asn Tyr Asn Tyr Gly Pro Ala Gly Arg Ala lie Gly Val Asp Leu Leu 

1 2 5 1 3 0 1 3 5 

Ala Asn Pro Asp Leu Val Ala Tbr Asp Ala Thr Val Gly Pbe Lys Tbr 
1 4 0 1 4 5 1 5 0 

Ala lie Trp Phe Tip Met Tbi Ala Gin Pro Pro Lys Pro Ser Ser His 
1 5 5 1 6 0 1 6 5 

Ala Val Me Ala Gly Gin Trp Ser Pro Ser Gly Ala Asp Arg Ala Ala 
170 175 180 185 



Gly Arg Val Pro Gly Phe Gly Val lie Thr Asn lie lie Asn Gly Gly 



1 9 0 



1 9 5 



2 0 0 



lie Glu Cys Gly His Gly Gin Asp Ser Arg Val Ala Asp Arg lie Gly 



2 0 5 



2 1 0 



2 1 5 



Pbe Tyr Lys Arg Tyr Cys Asp lie Leu Gly Vol Gly Tyr Gly Asn Asn 

220 225 230 

Leu Asp Cys Tyr Ser Gin Arg Pro Phe Ala 

2 3 5 2 4 0 



( 2 ) INFORMATION FOR SEQ Q> NO:ll: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 1235 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS : single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: cDNA 

( v i ) ORIGINAL SOURCE: 

( A ) ORGANISM: Hordcum vulgarc 
( B ) STRAIN: L. 

( i x ) FEATURE: 

( A ) NAME/KEY: 5*UTR 
( B ) LOCATION: 1..48 



( i x ) FEATURE 

( A ) NAME/KEY: CDS 



• 




—Jin 1tJ ^j,, iijtij| to jj f) ^ 
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( B ) LOCATION: 49..1050 

( D ) OTHER INFORMATION: /partial 

/ co don start" 49 

/ function" "glucanase" 

/ products "preprotciti of the glocanasc GluG" 

( i x ) FEATURE: 

( A ) NAME/KEY: 3'UTR 

( B ) LOCATION: 1051..1235 

( D ) OTHER INFORMATION: /partial 

/ notes "14 nucleotides at the 3'cnd not shown." 

( i x ) FEATURE: 

( A ) NAME/KEY: polyA_signal 
( B ) LOCATION: 1083..1088 

( D ) OTHER INFORMATION: /note- "potential polyadenylation 
signal" 

( i x ) FEATURE: 

( A ) NAME/KEY: polyA_signal 
( B ) LOCATION: 121Q..1215 

( D ) OTHER INFORMATION: /note* "potential polyadenylation 
signal" 

( i x ) FEATURE: 

( A ) NAME/KEY: mat_peptide 
( B ) LOCATION: 133..105O 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGCAGCATTG CATACCATTT GAGCACCAGA T ACTCCGTGT CTGCACCA ATG GCT AGA 57 

Met Ala A r g 
- 2 8 



AAA GAT GTT GCC TCC ATG TTT GCA GTT GCT CTC TTC ATT GGA GCA TTC 
Lys Asp Val Ala Sci Met Pbe Ala Val Ala Leu Phe lie Gly Ala Phe 



- 2 5 



2 0 



1 5 



- 1 0 



1 0 5 



GCT GCT GTT CCT ACG AGT GTG CAG TCC ATC GGC GTA TGC TAC GGC GTG 
Ala Ala Val Pro Thr Ser Val Gin Ser lie Gly Val Cys Tyr Gly Val 

- 5 1 5 



1 5 3 



ATC GGC AAC AAC CTC CCC TCC CGG AGC GAC GTG GTG CAG CTC TAC AGG 
lie Gly Asn Asn Leu Pro Ser Arg Ser Asp Vol Vol Gin Leu Tyr A r g 



1 0 



1 5 



2 0 



2 0 1 



TCC A A G GGC ATC AAC GGC ATG CGC ATC TAC TTC GCC GAC GGG CAG GCC 
Ser Lys Gly lie Asn Gly Met Arg lie Tyr Phe Ala Asp Gly Gin Ala 



2 5 



3 0 



3 5 



2 4 9 



CTC TCG GCC GTC CCC AAC TCC GGC ATC GGC CTC ATC CTC GAC ATC GGC 
Leu Ser Ala Val Arg Asn Ser Gly lie Gly Leu lie Leu Asp lie Gly 



4 0 



4 5 



5 0 



5 5 



2 9 7 



AAC GAC CAG CTC GCC AAC ATC GCC GCC AGC ACC TCC AAC GCG GCC TCC 
Asn Asp Gin Leu Ala Asn lie Ala Ala Ser Thr Ser Asn Ala Ala Ser 



6 0 



6 5 



7 0 



3 4 5 



TGG GTC CAG AAC AAC GTG CGG CCC TAC TAC CCT GCC GTG AAC ATC A A G 
Trp Val Gin Asn Asn Val Arg Pro Tyi Tyr Pro Ala Val Asn lie Lys 



7 5 



8 0 



8 5 



3 9 3 



TAC ATC GCC GCC GGC AAC GAG GTG CAG GGC GGC GCC ACG CAG AGC ATC 
Tyr lie Ala Ala Gly Asn Glu Val Gin Gly Gly Ala Thr Gin Scr lie 



9 0 



9 5 



1 0 0 



4 4 1 



CTG CCG GCC ATG CGC AAC CTC AAC GCG GCC CTC TCC GCG GCG GGG CTC 
Leu Pro Ala Met Arg Asn Leu Asn Ala Ala Leu Ser Ala Ala Gly Leu 



1 0 5 



1 1 0 



1 1 5 



4 8 9 



GGC GCC ATC A A G GTG TCC ACC TCC ATC CGG TTC GAC GAG GTG GCC AAC 



Gly Ala lie Lys Va 
1 2 0 



Ser Tbr Scr lie Arg Pbc Asp Glu Val Ala Asn 



1 2 5 



1 3 0 



1 3 5 



5 3 7 



TCC TTC CCG CCC TCC GCC GGC GTG TTC A A G AAC GCC TAC ATG ACG GAC 
Ser Pbe Pro Pro Ser Ala Gly Val Pbe Lys Asn Ala Tyr Met Tbr Asp 



1 4 0 



I 4 5 



1 5 0 



5 8 5 



i 
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GTG GCC CGG CTC CTG GCG AGC ACC GGC GCG CCG CTG CTC GCC AAC GTC 633 
Val Ala Atg Leo Leu Ala Set Thr Gl y Ala Pio Leu Leu Ala Asn Val 

1 5 5 1 6 0 1 6 5 

TAC CCC TAC TTC GCG TAC CGT GAC AAC CCC GGG AGC AT C AGC CTG AAC 681 
Tyr Pro Tyr Pbe Ala Ty r Arg Asp Asn Pro Gly Sei lie Ser Leu Asn 
1 7 0 1 7 5 1 8 0 

TAC GCG ACG TTC CAG CCG GGC ACC ACC GTG CGT GAC CAG AAC AAC GGG 729 
Tyr Ala Thr Phe Gin Pro Gly Thr Thr Val Arg Asp Gin Asn Asn Gly 
1 8 5 1 9 0 1 9 5 

CTG ACC TAC ACG TCC CTG TTC GAC GCG ATG GTG GAC GCC GTG TAC GCG 777 
Leu Thr Tyr Thr Ser Leu Phe Asp Ala Met Val Asp Ala Val Tyr Ala 
200 205 210 215 

GCG CTG GAG A A G GCC GGC GCG CCG GCG GTG A A G GTG GTG GTG TCG GAG 825 
Ala Leu Gl u Lys Ala Gly Ala Pro Ala Val Lys Val Val Vol Ser Gin 

220 225 230 

AGC GGG TGG CCG TCG GCG GGC GGG TTT GCG GCG TCG GCC GGC A A T GCG 873 
Ser Gly Trp Pro Ser Ala Gly Gly Phe Ala Ala Ser Ala Gly Asn Ala 

235 240 245 

CGG ACG TAC AAC CAG GGG CTG ATC AAC CAC GTC GGC GGG GGC ACG CCC 921 
Arg Thr Tyr Asn Gin Gly Leu lie Asn His Val Gly Gly Gly Thr Pro 
250 255 260 

A A G A A G CGG GAG GCG CTG GAG ACG TAC ATC TTC GCC ATG TTC AAC GAG 969 
Lys Lys Arg Glu Ala Leu Gin Thr Tyr lie Phe Ala Met Phe Ann Glu 
265 270 275 

AAC CAG A A G ACC GGG GAC GCC ACG GAG AGG AGC TTC GGG CTC TTC AAC 1017 
Asn Gin Lys Thr Gly Asp Ala Thr Glu Arg Ser Phe Gly Leu Phe Asn 
280 285 290 295 

CCG GAC A A G TCG CCG GCA TAC AAC ATC CAG TTC TAGTACGTGT AGCTACCTAG 1070 
Pro Asp Lys Ser Pro Ala Tyr Asn lie Gin Phe 

3 0 0 3 0 5 

CTCACATACC TAAATAAATA AGCTGCACGT ACGTACGTAA TGCGGCATCC AAGTGTAACG 1130 

TAGACACGTA CATTCATCCA TGGAAGAGTG CAACCAAGCA TGCGTTAACT TCCTGGTGAT 1190 

GATACATCAT CATGGTATGA ATAAAAGATA TGGAAGATGT TATGA 1235 

( 2 ) INFORMATION FOR SEO ID NO: 12: 

( } ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 334 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: lineaT 

( i i ) MOLECULE TYPE: protein 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ala Arg Lys Asp Val Ala Ser Met Phe Ala Val Ala Leu Phe lie 

-28 -25 -20 -15 

Gly Ala Phe Ala Ala Val Pro Thr Ser Val Gin Ser lie Gly Val Cys 
- 1 0 - 5 1 

Tvr Glv Val lie Gly Asn Asn Leu Pro Ser Arg Ser Asp Val Val Gin 
5 10 15 2 0 

Leu Tyr Arg Ser Lys Gly lie Asn Gly Mel Arg lie Tyr Phe Ala Asp 

2 5 3 0 3 5 

Gly Gin Ala Leu Ser Ala Val Arg Asn Ser Gly lie Gly Leu lie Leu 

4 0 4 5 5 0 

Asp lie Gly Asn Asp Gin Leu Ala Asn Me Ala Alo Ser Thr Ser Asn 
5 5 6 0 6 5 

Ala Ala S,er Trp Val Gin Asn Asn Val Arg Pro Tyr Tyr Pro Ala Val 
7 0 7 5 8 0 



4 




SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT : 



A 
B 
C 
E 
F 

A 
B 
C 
E 
F 

A 
B 
C 
E 
F 

A 
B 
C 
E 
F 

A 
B 
C 
E 
F 

A 
B 
C 
E 
F 

A 
B 
C 
E 
F 



NAME: Logemann, Juergen 
STREET: Lavendeltuin 5 
CITY: NB Leiden 
COUNTRY : The Netherlands 
POSTAL CODE (ZIP) : 2317 

NAME : Jach, Guido 

STREET: Maternusstrasse 22 

CITY: Koeln 

COUNTRY : Germany 

POSTAL. CODE (ZIP) : 50678 

NAME:. Goernhardt, Birgit 

STREET: Auf dem Knoepp 2 8 

CITY: Koeln 

COUNTRY : Germany 

POSTAL CODE (ZIP) : 51145 

NAME : Mundy , John 

STREET: NY Carlsberg Vej 6, 4th 

CITY: V Copenhagen 

COUNTRY : Denmark 

POSTAL CODE (ZIP) : 1760 

NAME: Schell, Jeff 

STREET: Carl-von-Linne-Weg 10 

CITY: Koeln 

COUNTRY : Germany 

POSTAL CODE (ZIP) : 50829 

NAME: Eckes, Peter 
STREET: Am Flachsland 18 
CITY: Kelkheim (Taunus) 
COUNTRY : Germany 
POSTAL CODE (ZIP) : 65779 

NAME: Chet, I lan 
STREET: Shikun Ezrachi 
CITY: Nes Ziona 
COUNTRY: Israel 
POSTAL CODE (ZIP) : 70400 



(ii) TITLE OF INVENTION: Transgenic pathogen-resistant organism 



(iii) NUMBER OF SEQUENCES: 12 



(iv) CORRESPONDENCE ADDRESS: 





(A) ADDRESSEE: Baker & Botts 

(B) STREET: 30 Rockefeller Plaza 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: U.S.A. 

(F) ZIP: 10112 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: reissue application of U.S. Patent No. 5,804,184 
(B) FILING DATE: herewith 

(vii) ATTORNEY /AGENT INFORMATION: 



(A) ATTORNEY NAME: Tenser, Arthur 

(B) REGISTRATION NUMBER: 18,839 

(C) AGENT NAME: Kole, Lisa 

(D) REGISTRATION NUMBER: 35,225 

(E) REFERENCE NO: A29542 FWC I- 37/31335 



INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus giganteus 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..45 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 46.. 225 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /codon start= 46 



(viii) 



TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 408-2500 

(B) TELEFAX: (212) 765-2519 

(C) TELEX: 238555 



/function= "antifungal agent 



it 




/product= "antifungal peptide) 11 

/evidence= EXPERIMENTAL 

/note= "antifungal agent, especially on 

Rhizoctonia solani, various Aspergillus, Fusaria 

and Trichophyton species" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TTGCCACCCC CGTTGAAGCC GATTCTCTCA CCGCTGGTGG TCTGG ATG CAA GAG 54 

Met Gin Glu 
1 



ATG AGA GCG CGG GTT TTG GCC ACA TAC AAT GGC AAA TGC TAC AAG AAG 102 

Met Arg Ala Arg Val Leu Ala Thr Tyr Asn Gly Lys Cys Tyr Lys Lys 

5 10 15 

GAT AAT ATC TGC AAG TAC AAG GCA CAG AGC GGC AAG ACT GCC ATT TGC 150 

Asp Asn lie Cys Lys Tyr Lys Ala Gin Ser Gly Lys Thr Ala lie Cys 

20 25 30 35 

AAG TGC TAT GTC AAA AAG TGC CCC CGC GAC GGC GCG AAA TGC GAG TTT 198 

Lys Cys Tyr Val Lys Lys Cys Pro Arg Asp Gly Ala Lys Cys Glu Phe 

40 45 50 

GAC AGC TAC AAG GGG AAG TGC TAC TGC TAGACGGTGA GCGAAGGGAC 24 5 



Asp Ser Tyr Lys Gly Lys Cys Tyr Cys 

55 60 

GAAGTAGGCT GGGGGTTATT TTACTCTGCT 275 



(3) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

Met Gin Glu Met Arg Ala Arg Val Leu Ala Thr Tyr Asn Gly Lys Cys 
15 10 15 

Tyr Lys Lys Asp Asn lie Cys Lys Tyr Lys Ala Gin Ser Gly Lys Thr 

20 25 30 

Ala lie Cys Lys Cys Tyr Val Lys Lys Cys Pro Arg Asp Gly Ala Lys 





35 40 45 

Cys Glu Phe Asp Ser Tyr Lys Gly Lys Cys Tyr Cys 
50 55 60 



(4) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: C-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus giganteus 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1. .51 

(D) OTHER INFORMATION: /note= "active protein fragment of 
AFP" 



(xi) SEQUENCE DESCRIPTION: SEQ" ID NO : 3 : 

Ala Thr Tyr Asn Gly Lys Cys Tyr Lys Lys Asp Asn lie Cys Lys Tyr 
15 10 15 

Lys Ala Gin Ser Gly Lys Thr Ala lie Cys Lys Cys Tyr Val Lys Lys 

20 25 30 

Cys Pro Arg Asp Gly Ala Lys Cys Glu Phe Asp Ser Tyr Lys Gly Lys 
35 40 45 

Cys Tyr Cys 
50 



(5) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 




(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Hordeum vulgare 

(B) STRAIN: L.cv. Piggy 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA gene bank in lambda-gt- 11 -phages 



(ix) FEATURE: 

(A) NAME/ KEY: 5 ' UTR 

(B) LOCATION: 1. .42 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 43 . . 885 

(D) OTHER INFORMATION: /codon_start= 43 
/function= "antifungal activity" 
/product= "protein synthesis inhibitor (PSI) " 
/note= "antifungal activity, especially on spores 
of Trichoderma reesii and Fusarium sporotrichoides 
and on Rhizoctonia solani . " 



(ix) FEATURE: 

(A) NAME/ KEY : 3 ' UTR 

(B) LOCATION: 886. .1032 

(D) OTHER INFORMATION: /partial 

/note= "46 nucleotides at the 3 1 -end not shown." 



(ix) FEATURE: 

(A) NAME/KEY: polyA_signal 

(B) LOCATION: 930 . . 935 

(D) OTHER INFORMATION: /note= 
signal " 



"potential polyadenylation 



(ix) FEATURE: 

(A) NAME / KEY : polyA_signal 

(B) LOCATION: 963 .. 976 

(D) OTHER INFORMATION: /note= 
signal " 



"potential polyadenylation 



(ix) FEATURE: 

(A) NAME/KEY: polyA_signal 

(B) LOCATION: 1002 . . 1011 

(D) OTHER INFORMATION: /note= "potential polyadenylation 
signal " 

(ix) FEATURE: 

(A) NAME/ KEY : mat_peptide 

(B) LOCATION: 46.. 886 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 



CTTAATAGCA CATCTTGTCC GTCTTAGCTT TGCATTACAT CC ATG GCG GCA AAG 



54 



